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We  describe  a  procedure  that  uses  incomplete  data  to  estimate  failure  rate 
and  survival  functions.  Although  the  procedure  is  designed  for  discrete  distri¬ 
butions,  it  applies  in  the  continuous  case  also. 

Our  description  is  expository  and  therefore  contains  no  proofs:  they  are  pro¬ 
vided  by  Mimmack  (1985) . 

The  procedure  is  based  on  the  assumption  of  a  piecewise  constant  failure 
rate.  The  resultant  survival  function  estimator  is  a  piecewise  geometric  function 
denoted  the  Piecewise  Geometric  Estimator  (PEGF.) .  The  PEGE  is  the  discrete  ver¬ 
sion  of  the  piecewise  exponential  estimators  proposed  independently  by  Kitchin, 
Langberg  and  Proschan  (1983)  and  Whittemore  and  Keller  (1983),  and  it  is  a 
generalization  of  an  estimator  of  Umholtz  (1984)  who  considers  complete  data 
taken  from  an  exponential  distribution. 


20.  ABSTRACT  (Continued) 


The  PEGE  is  consistent  and  asymptotically  normal  under  conditions  more 
general  than  those  of  the  model  of  random  censorship.  Although  the  PEGE  and  the 
widely  used  Kaplan-Meier  estimator  (KME)  are  asymptotically  equivalent  and 
generally  interlace,  the  PEGE  is  expected  to  perform  better  than  the  KME  in 
terms  of  small  sample  properties. 

The  PEGE  is  attractive  to  users  because  it  is  computationally  simple  and 
realistic  in  that  it  decreases  at  every  possible  failure  time:  it  therefore 
not  only  has  the  appearance  of  a  survival  function,  but  also  provides  a  realistic 
estimate  of  the  failure  rate  function.  The  KME,  in  contrast,  is  a  step  function. 


.  > 


Section  1 :  Introduction  and  Summary 


The  problem  of  estimating  survival  probabilities  from  incomplete  data  is 
well  known  in  the  fields  of  reliability,  medicine,  biometry  and  actuarial  science. 
The  general  situation  is  described  as  follows.  The  variable  of  interest  is  the 
lifespan  of  some  unit:  the  investigator  wishes  to  estimate  the  probability  of 
survival  beyond  any  given  time.  To  this  end,  n  identical  units  are  placed 
"on  test".  Each  item  is  either  observed  until  failure,  resulting  in  an  uncen¬ 
sored  observation,  or  is  removed  from  the  test  before  failure,  resulting  in  a 
censored  observation.  Thus  the  data  available  consist  of  a  number  of  lifelengths 
and  a  number  of  truncated  lifelengths:  the  statistical  problem  is  to  estimate 

the  probability  distribution  of  the  lifelengths. 

The  various  statistical  approaches  to  the  problem  can  generally  be  classified 
according  to  the  restrictiveness  of  the  model  assumed  and  the  type  of  information 
utilized.  At  one  extreme  are  purely  parametric  procedures,  which  involve 
assuming  that  the  underlying  life  distribution  belongs  to  a  specific  parametric 
family.  These  procedures  utilize  interval  information.  The  Bayesian  estimator 
described  by  Susarla  and  Van  Ryzin  (1976)  makes  allowance  for  both  parametric 
and  nonparametric  models:  the  type  of  information  utilized  depends  on  the 
assumptions  about  the  prior  distribution.  As  our  approach  to  the  problem  is 
neither  parametric  nor  Bayesian,  we  do  not  consider  these  procedures  further  but 
concentrate  on  nonparametric  procedures. 

Nonparametric  procedures  range  in  sophistication  from  the  well-known 
actuarial  estimator  *  which  is  a  step  function  constructed  from  ordinal  infor¬ 
mation  alone,  to  the  piecewise  polynomial  estimators  of  Whittemore  and  Keller 
(1983)  that  utilize  interval  information.  The  most  widely  used  nonparametric 
estimators  are  those  of  Kaplan  and  Meier  (1958)  and  Nelson  (1969).  These 
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ABSTRACT 

We  describe  a  procedure  that  uses  incomplete  data  to  estimate  failure  rate 
and  survival  functions.  Although  the  procedure  is  designed  for  discrete  distri¬ 
butions,  it  applies  in  the  continuous  case  also. 

Our  description  is  expository  and  therefore  contains  no  proofs:  they  are 
provided  by  Mimmack  (1985). 

The  procedure  is  based  on  the  assumption  of  a  piecewise  constant  failure 
rate.  The  resultant  survival  function  estimator  is  a  piecewise  geometric  function 
denoted  the  Piecewise  Geometric  Estimator  (PEGE) .  The  PEGE  is  the  discrete  ver¬ 
sion  of  the  piecewise  exponential  estimators  proposed  independently  by  Kitchin, 
Langberg  and  Proschan  (1983)  and  Whittemore  and  Keller  (1983) ,  and  it  is  a 
generalization  of  an  estimator  of  Umholtz  (1984)  who  considers  complete  data 
taken  from  an  exponential  distribution. 

The  PEGE  is  consistent  and  asymptotically  normal  under  conditions  more 
general  than  those  of  the  model  of  random  censorship.  Although  the  PEGE  and  the 
widely  used  Kaplan-Meier  estimator  (KME)  are  asymptotically  equivalent  and 
generally  interlace,  the  PEGE  is  expected  to  perform  better  than  the  KME  in 
terms  of  small  sample  properties. 

The  PEGE  is  attractive  to  users  because  it  is  computationally  simple  and 
realistic  in  that  it  decreases  at  every  possible  failure  time:  it  therefore  not 
only  has  the  appearance  of  a  survival  function,  but  also  provides  a  realistic 
estimate  of  the  failure  rate  function.  The  KME,  in  contrast,  is  a  step  function. 


estimators  are  also  step  functions  constructed  from  ordinal  information.  Their 
properties  are  described  by  Efron  (1967),  Breslow  and  Crowley  (1974),  Petersen 
(1977),  Aalen  (1976,  1978),  Kitchin,  Langberg  and  Proschan  (1983),  Nelson  (1972), 
Fleming  and  HarTington  (1979),  and  Chen,  Hollander  and  Langberg  (1982). 

One  of  the  by-products  of  the  estimation  process  is  an  estimate  of  the 
failure  rate  function:  here,  another  issue  is  raised.  It  is  evident  that  sur¬ 
vival  function  estimators  that  are  step  functions  do  not  provide  useful  failure 
rate  function  estimators:  Miller  (1981)  mentions  smoothing  the  Kaplan-Meier 
estimator  for  this  reason  and  summarizes  the  development  of  other  survival  func¬ 
tion  estimators  that  may  be  obtained  by  considering  a  special  case  of  the  regres¬ 
sion  model  of  Cox  (1972),  These  estimators  generally  correspond  to  failure  rate 
function  estimators  that  are  step  functions  and  utilize  at  most  part  (but  not  all) 
of  the  interval  information  contained  in  the  data.  Whittemore  and  Keller  (1983) 
give  several  more  refined  failure  rate  function  estimators  that  are  step  functions 
and  utilize  full  interval  information.  They  also  describe  even  more  complex 
estimators  that  utilize  full  interval  information:  however,  these  are  not  compu¬ 
tationally  convenient  compared  with  their  simpler  estimators.  It  seems,  from 
their  work,  that  a  successful  rival  of  the  Kaplan-Meier  estimator  should  be  only 
marginally  more  complex  than  it  (so  as  to  be  computationally  convenient  and  yet 
yield  a  useful  failure  rate  function  estimator)  and  also  should  utilize  more  than 
ordinal  information. 

In  Section  2,  we  propose  an  estimator  that  not  only  provides  a  reasonable 
failure  rate  function  estimator  but  also  utilizes  interval  information.  More¬ 


over,  it  is  computationally  simple.  Our  estimator  is  a  discrete  counterpart  of 
two  versions  of  a  continuous  estimator  proposed  independently  by  Kitchin,  Langberg 
and  Proschan  (1983)  and  Whittemore  and  Keller  (1983).  The  motivation  for  the 
construction  of  our  estimator  is  the  same  as  that  of  the  former  authors,  and  our 
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model  is  the  discrete  version  of  theirs:  in  contrast,  the  latter  authors  assume 
the  more  restrictive  model  of  random  censorship  and  obtain  their  estimator  by 
the  method  of  maximum  likelihood.  This  provides  an  alternative  method  of  deriv¬ 
ing  our  estimator. 

The  remaining  sections  are  concerned  with  properties  of  our  estimator.  As 
this  presentation  is  expository,  proofs  are  omitted:  Mimmack  (1985)  provides 
proofs. 

In  Section  3,  we  explore  the  asymptotic  properties  of  our  estimator  under 
increasingly  restrictive  models.  Our  estimator  is  strongly  consistent  and 
asymptotically  normal  under  conditions  more  general  than  those  typically  assumed. 

Section  4  deals  with  the  relationships  among  our  estimator,  the  Kaplan- 
Meier  estimator,  and  the  above-mentioned  estimator  of  Kitchin  et  al.  and 
Whittemore  and  Keller.  The  section  ends  with  an  example  using  real  data. 

In  Section  5,  we  continue  the  comparison  of  the  new  estimator  and  the 
Kaplan-Meier  estimator:  since  the  properties  of  the  new  estimator  are  expected 
to  resemble  those  of  its  continuous  counterparts,  we  discuss  the  implications 
of  simulation  studies  designed  to  investigate  the  small  sample  behaviour  of 
these  estimators.  We  also  present  the  results  of  a  Monte  Carlo  pilot  study 
designed  to  investigate  the  small  sample  properties  of  our  estimator. 


Section  2:  Preliminaries. 


In  this  section  we  formulate  the  problem  in  statistical  terms  and  define 
our  estimator. 

Let  X  denote  the  lifelength  of  a  randomly  chosen  unit,  where  X  has  distri¬ 
bution  function  G.  Suppose  that  n  identical  items  are  placed  on  test.  The 
resultant  sample  consists  of  the  pairs  (Z^,6^),  ...,  (Zn ,^n) »  where  represents 
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the  time  for  which  unit  i  is  observed  and  6.^  indicates  whether  unit  i  fails 
while  under  observation  or  is  removed  from  the  test  before  failure.  Symbolically, 
for  i  =  1,  . n,  we  have: 

=  life  length  of  unit  i,  where  has  distribution  G, 

Y^  =  time  to  censorship  of  unit  i, 
li  =  min(Xi,Yi), 

6.  =  I(X.SY.). 

^l'Yp*  (*n .Y^)  are  assumed  to  be  independent  random  pairs.  Elements 

of  a  pair  X..  and  Y^,  where  i  =  1,  ...,  n,  are  not  assumed  to  be  independent. 

We  assume  that  the  lifelength  and  censoring  random  variables  are  discrete. 

Let  X  s  denote  the  set  of  possible  values  of  X  and  Y  * 

denote  the  union  of  the  sets  of  possible  values  of  Yj,  Y^,  . ..,  where  V  £.  X. 

The  survival  probabilities  of  interest  are  denoted  P(X>xk),  k  =  1,  2,  .... 
where  P(X>xk)  =  GC^)  »  1  -  G^)  ,  k  =  1,  2,  ...  . 

It  is  evident  that  this  formulation  differs  from  that  of  the  model  of 
random  censorship  which  is  generally  assumed  in  the  literature,  and  in  parti¬ 
cular,  by  Whittemore  and  Keller  (1983).  These  authors  assume  that  the  lifelength 
and  censoring  random  variables  are  continuous,  that  the  corresponding  pairs  X^ 
and  Y^,  where  i  =  1,  2,  ...,  are  independent,  and  that  the  censoring  random  vari¬ 
ables  are  identically  distributed.  Although  Kaplan  and  Meier  (1958)  assume 
only  independence  between  corresponding  lifelength  and  censoring  random  variables, 
Breslow  and  Crowley  (1974),  Petersen  (1977),  Aalen  (1976,  1978),  and  others  -- 
all  of  whom  describe  the  properties  of  the  Kaplan-Meier  estimator  --  assume  also 
that  the  censoring  random  variables  are  identically  distributed.  Our  formulation 
is  the  discrete  counterpart  of  that  of  Kitchin,  Langberg  and  Proschan  (1983): 
likewise,  our  estimator  is  the  discrete  counterpart  of  theirs. 
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Before  describing  our  estimator,  we  give  the  notation  required. 


Let  n^  be  the  random  number  of  distinct  uncensored  observations  in  the 


sample  and  let  t,  <  t,  <  ...  <  t  denote  these  distinct  observed  failure 
i  t  n  j 


times,  with  t^  =  0.  Let  n^  be  the  random  number  of  distinct  censored  obser¬ 


vations  in  the  sample  and  let  s.  <  s_  <  ...  <  s  denote  these  times,  with 

1  2  n2 


s0  H  °* 


Let  D.  be  the  number  of  failures  observed  at  time  t. : 

i  l 


D.j  =  l  I  (Z  . =t . ,  5  .  =  1)  for  i  =  1,  ....  n.. 
1  j*l  3  1  3  1 


Let  be  the  number  of  censored  observations  equal  to  s^: 


*  » 

:z  =  l  I(Z.=s.,  6  .=0)  for  i  =  1,  ....  n_ . 
1  j=l  J  1  J  L 


Let  Fn(t)  =  1  -  Fn(t)  denote  the  proportion  of  observations  that 
exceed  t : 


,  n 

*L(t)  55  £  l  KZ.>t)  for  t  e  [0.-). 


n  •  i  3 

3  =  1  J 


Let  Fn(t)  denote  the  proportion  of  failures  observed  at  or  before  t 


1  1  n 

n(t)  =  ^  6i  =  1)  for  t  ^  C0,«). 

j  =  l  ■’  J 


Let  T\  be  a  measure  of  the  total  time  on  test  in  the  interval 


Ti  =  Vl  ‘  W  (nPn(ti>  *  Di> 


k:t.  ,<s.<t. 
i-l  k  i 


#{m:  for  i  =  1,  ...,  n^ , 


where  #A  denotes  the  cardinality  of  the  set  A. 
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r!f  failure  and  censoring  random  variables  are  lattice  random  variables,  ther. 

T\  is  the  total  time  on  test  in  (t^  j,t^].  *n  Senera^»  however,  T\  increases 

by  one  unit  whenever  an  item  on  test  survives  an  interval  of  the  form 

(Xj  where  t.  1  <  )c.  l  <  xj  “  ti»  irrespective  of  the  distance  between 

x.  ,  and  x. .) 

3-1  J 

We  now  construct  our  estimator.  Expressing  the  survival  function  G  in 
terms  of  the  failure  rates  P(X  =  Xjjx>xk),  k  =  1,  2,  we  have: 

k 


(2.1) 


P(X>xk)  =  JTCl  -  P(X  =  x.|x>  x.))  for  k  =  1,  2,  ... 
j=l  1  -1 


It  is  evident  from  (2.1)  that  we  may  estimate  our  survival  function  at 
x^  from  estimates  of  the  failure  rates  at  Xj,  x^,  ...,  x^.  In  the  experimental 
situation,  failures  are  not  observed  at  all  the  times  x^  x^,  ...  so  specific 
information  about  the  failure  rates  at  many  of  the  possible  failure  times  is 
not  available.  Having  observed  failures  at  t^,  t^,  ...»  tn  ,  we  find  it 
simple  to  estimate  the  failure  rates  P(X =  t^ | Xst^ ,  i  =  1,  ...,  n^.  However, 
the  question  of  how  to  estimate  the  failure  rates  at  the  intervening  possible 
failure  times  requires  special  consideration. 

One  approach  --  that  of  Kaplan  and  Meier  (1958),  Nelson  (1969)  and 
others  --  is  to  estimate  the  failure  rates  at  these  intervening  times  as  zero 
since  no  failures  are  observed  then.  However,  not  observing  failures  at 
some  possible  failure  times  may  be  a  result  of  being  in  an  experimental  situ¬ 
ation  rather  than  evidence  of  very  small  failure  rates  at  these  times,  so  we 
discard  this  approach  and  consider  nonzero  estimates. 


It  is  reasonable  to  assume  that  the  underlying  process  possesses  an 
element  of  continuity  in  that  adjacent  failure  rates  do  not  differ  radically 
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from  one  another.  Thus  we  consider  using  the  estimate  of  the  failure  rate  at 
t^  to  estimate  the  failure  rate  at  each  of  the  possible  failure  times  between 
t^  j  and  t^,  where  i  =  1,  ....  n^.  We  are  therefore  assuming  that  our  approx¬ 
imating  distribution  has  a  constant  failure  rate  between  the  times  at  which 
failures  are  observed  --  that  is: 


(2.2)  P(X=xk|X>xR)  =  qA  for  t._j  <  ^  s  t.,  i  *  1,  ....  n^ 


where 


qi  =  P(X=ti|X>ti)  for  i  =  1,  ...,  n^ 


Substituting  (z.2)  into  (2.1),  we  obtain: 

A  #{m:t.  ,<x  %)  i-1  #(m:t.  ,<x<t.) 

(2.3)  P(X>x  )  =  (1-q  )  1-1  TT(l-q.)  3-1  m  3 

K  1  j»l  3 


for  t._x  <  xk  <  t.,  i  =  1,  ...,  nj. 


Ne  note  that  the  property  of  having  constant  failure  rate  on  X  character¬ 
izes  a  family  of  geometric  distributions  defined  on  X.  In  particular,  the 
failure  rates  ...,  q^  identify  n^  geometric  distributions  Gj ,  ..., 

defined  on  X.  The  survival  functions,  ,  . . . ,  Gn  ,  have  the  geometric  form: 

(2.4)  G.(xk)  =  (1-q^. )  ^  for  k  =  1,  2,  ...  and  i  =  1 ,  ...,  n^. 

Inspection  of  (2.3)  and  (2.4)  reveals  that  our  estimating  function  is 
constructed  from  the  geometric  survival  functions  ,  ...,  Gn  ,  where  G^  is  used 
in  the  interval  (t^_j,t^],  i  =  1,  ...,  n^.  Consequently,  the  estimator  (2.3) 
is  called  the  Piecewise  Geometric  Estimator  (PEGE) . 


It  remains  to  define  estimators  of  the  failure  rates  q,,  ....  a 

nl 

This  was  originally  done  by  separately  obtaining  the  maximum  likelihood 
estimators  of  the  parameters  of  nj  truncated  geometric  distributions:  the  pro¬ 
cedure  is  outlined  at  a  later  stage  because  it  utilizes  the  geometric  structure 
of  (2.3)  and  therefore  provides  further  motivation  for  the  name  "PEGE".  A 
more  straightforward  but  less  appealing  approach  is  to  obtain  the  maximum  like¬ 
lihood  estimates  of  q. . q  directly:  denoting  by  L  the  likelihood  of 

l  nx 

the  sample,  we  have: 


1  D. 

TT  CP(X=t.)]  1 * 
i=l  1 


TT cp(x>s.)]  3 

3  =  1  3 


Substituting  (2.3)  into  this  expression  and  differentiating  yields  the  unique 
maximum  likelihood  estimates 


q.  =  D.  /T.  ,  i  =  1 ,  . . .  ,  n.  . 
xii  1 


Substituting  q .  q  into  (2.3),  we  finally  obtain  our  estimator,  formally 

l  ^ 

defined  as  follows. 


Definition  2.1.  The  Piecewise  Geometric  Estimator  (PEGE)  of  the  survival 
function  of  the  lifelength  random  variable  X  is  defined  as  follows: 


for  x,  s  0  or  n  =  0. 
k  1 


#{m:t .  .  <x  ^x,  }  i-1 

(1-D/T)  1-1  m  K  “fTO-W 

j  =  l  3  3 


#{m:t .  ,<x  St.) 
3-1  m  3 


P(X>xk)  = 


for  t._1  <  <  t.,  i  =  1,  ...,  iij,  n1  >  0. 


1  #{m:t.  <xm<t.) 

TT  (1-D  /T  )  3'1  m  3 

3  =  1  3  3 

for  xk  >  nj  >  0. 


0 


The  alternative  derivation  of  the  PEGE  emphasizes  its  geometric  structure. 


it  turns  out  that  q^, 


, qn^  defined  above  are  maximum  likelihood  estimators 


of  the  parameters  of  the  truncated  geometric  distributions  G*  G*  defined 

l  nl 

below. 


For  i  =  1,  . . . ,  n^  we  formulate  the  following  definitions: 

Let  =  #{m:  be  the  number  of  possible  times  of  failure  in 

the  interval  (t.^,  t^3  and  let  X?  be  the  number  of  possible  times  of  failure 
that  a  unit  of  age  t^  ^  survives  --  that  is. 


X?  =  number  of  trials  to  failure  of  a  unit  of  age  t^ 

where  the  possible  values  of  X?  are  assumed  to  be  1,  2,  ...,  N^,  N?.  The 
distribution  G?  of  X?  is  then  given  by: 

G|(k)  =  (l-q.)k  for  k  =  1,  2,  ....  N., 

G?(Nt)  =  0. 


The  information  available  for  estimating  qi  consists  of  nF^^  p 

observations  on  X*:  of  these,  D.  are  equal  to  N. ,  nF  (t.)  are  equal  to  Nt, 
i  l  '  i  n  l  n  i' 

and  for  all  s^  in  the  interval  (t^  ^,tp,  are  equal  to  the  number 

#{m:  t.  ,<x  £s.}.  The  resultant  maximum  likelihood  estimator  of  q.  is 
l-l  m  j 

precisely  q^  defined  above. 

It  is  evident  that  the  estimators  q.,  ...,  q„  have  the  form  of  the  usual 

1*  nn^ 

maximum  likelihood  estimator  of  a  geometric  parameter  —  that  is. 


Estimated  failure  rate  = 


number  of  failures  observed 
total  time  on  test 


Moreover,  we  note  that  this  is  the  form  of  the  failure  rate  estimators  in  the 

intervals  (tn,t.],  ...»  (t  ,»)  defined  for  the  Piecewise  Exponential  Estimator 
u  i  n  ^ 

(PEXE)  of  Kitchin,  Langberg  and  Proschan  (1983).  In  terms  of  our  notation 


(modified  for  continuity),  the  PEXE  is  defined  as  follows: 
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(2.5)  P*(X>t) 


1  for  t  <  0  or  n^  =  0 , 

i-1 

exp[-(t-t.  .)%. ]  TT exp[-(t.-t .  .)£  ] 

1  j=i  J  J"1  J 

for  t^  <  t  s  t^,  i  =  1,  ....  n^,,  >  0, 


X 

UexpC -(t.-t.  t)x  ; 
j=l  1  j  1  3 


for  t  >  t  ,  n,  >  0, 

nj'  1  ’ 


where 

X^  =  1/Yi  for  i  =  1,  . n., 

Yi  =  ^  nFn(u)du  for  i  =  1,  ...»  n,. 

ti-l 


For  i  =  1,  . . nj,  X^  is  the  failure  rate  in  the  interval  (t^  j,^]  and  y i  is 
the  total  time  on  test  in  this  interval. 

The  PEXE  is  a  piecewise  exponential  function  because  its  construction 
is  based  on  the  assumption  of  constant  failure  rate  between  observed  failures: 
just  as  a  constant  discrete  failure  rate  characterizes  a  geometric  distribution 
so  a  constant  continuous  failure  rate  characterizes  an  exponential  distribution. 
Thus  the  PEGE  is  the  discrete  counterpart  of  the  PEXE. 

Returning  to  our  introductory  discussion  about  the  desirable  features 
of  survival  function  estimators,  we  now  compare  the  PEGE  with  other  estimators 
in  terms  of  these  and  other  features. 

First,  the  PEGE  is  intuitively  pleasing  because  it  reflects  the  continuity 
inherent  in  any  life  process.  The  Kaplan-Meier  and  other  estimators  that  are 
step  functions  do  not  have  this  property. 


Second,  we  note  that  the  PEGE  utilizes  interval  information  from  both 
censored  and  uncensored  observations.  It  is  therefore  more  sophisticated  than 
the  Kaplan-Meier  and  Nelson  estimators.  Moreover,  none  of  the  estimators  of 
Whittemore  and  Keller  utilizes  more  information  than  does  the  PEGE. 

Third,  the  PEGE  provides  a  simple,  useful  estimator  of  the  failure  rate 
function.  While  this  estimator  is  naive  compared  with  the  nonlinear  estimators 
of  Whittemore  and  Keller,  the  PEGE  has  the  advantage  of  being  simple  enough  to 
calculate  by  hand  --  moreover  it  requires  only  marginally  more  computational 
effort  than  does  the  Kaplan-Meier  estimator. 

Regarding  the  applicability  of  the  PEGE,  we  note  that  use  of  the  PEGE 
is  not  restricted  to  discrete  distributions  because  it  can  be  easily  modified 
by  linear  interpolation  or  by  being  defined  as  continuous  wherever  necessary. 
This  is  theoretically  justified  by  the  fact  that  the  integer  part  of  an  exponen¬ 
tial  random  variable  has  a  geometric  distribution:  by  defining  the  PEGE  to  be 
continuous,  we  are  merely  defining  a  variant  of  the  PEXE.  The  properties  of 
this  estimator  follow  immediately  from  those  of  the  PEXE. 

Finally,  apart  from  being  intuitively  pleasing,  the  form  of  the  PEGE 
allows  reasonable  estimates  of  both  the  survival  function  and  its  percentiles. 
The  Kaplan-Meier  estimator  is  known  to  overestimate  because  of  its  step  function 
form.  We  show  in  a  later  section  that  the  PEGE  tends  to  be  less  than  the 
Kaplan-Meier  estimator,  and  therefore  the  PEGE  may  be  more  accurate  than  the 
Kaplan-Meier  estimator.  Whittemore  and  Keller  give  some  favourable  indications 
in  this  respect.  They  define  three  survival  function  estimators  that  have 
constant  failure  rate  between  observed  failure  times.  One  of  these  is  the 
PEXE,  modified  for  ties  in  the  data:  the  form  of  the  failure  rate  estimator 
is  the  same  as  the  form  of  the  PEGE  failure  rate  estimator  —  specifically, 
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for  i  =  1,  . . . ,  n. 


(2.6)  Estimated  failure  rate  in  (t.  ,  ,t.]  = 


number  of  failures  observed  at  t. 

i 


i-1’  i  total  time  on  test  during  (t^  j.t^] 

The  second  of  these  estimators  is  defined  instead  on  intervals  of  the  form 
Cti-l»V ;  ^or  i  =  n!»  ^e  failure  rate  estimator  has  the  form: 


(2.7)  Estimated  failure  rate  in  [t.  .,t.)  = 


number  of  failures  observed  at  t^  j 


i-1*  i  total  time  on  test  during  [t^  ^.t-j 

The  third  of  these  estimators  is  obtained  from  the  average  of  the  two  failure 
rate  estimators  described  by  (2.6)  and  (2.7). 

In  a  simulation  study  to  investigate  the  small  sample  properties  of  these 
three  estimators,  Whittemore  and  Keller  find  that  the  first  estimator  tends  to 
underestimate  the  survival  function  while  the  second  tends  to  overestimate  the 
survival  function.  From  these  results,  we  expect  the  PEGE  to  underestimate  the 
survival  function  and  its  percentiles.  Whittemore  and  Keller  do  not  record 
further  results  for  the  first  two  estimators:  however,  they  do  indicate  that, 
in  terms  of  bias  at  extreme  percentiles,  variance  and  mean  square  error,  the 
third  estimator  tends  to  be  better  than  the  Kaplan-Meier  estimator. 

The  implications  for  the  discrete  version  of  the  third  estimator  are  that, 
in  terms  of  bias,  variance  and  mean  square  error,  it  will  compare  favourably  with 
the  Kaplan-Meier  estimator.  An  unanswered  question  is  whether  the  performance 
of  this  estimator  is  so  superior  to  the  performance  of  the  PEGE  as  to  warrant 
the  additional  computational  effort  required  for  the  former. 


Section  3:  Asymptotic  Properties  of  the  PEGE. 

This  section  treats  the  asymptotic  properties  of  the  PEGE  and  of  the 
corresponding  failure  rate  function  estimator.  The  properties  of  primary 


interest  are  those  of  consistency  and  asymptotic  normality:  secondary  issues 
are  asymptotic  bias  and  asymptotic  correlation. 

Initially  considering  a  very  general  model,  we  obtain  the  limiting  function 
of  the  PEGE  and  show  that  the  sequences  {^n(X>xk)  }^-l  ^n^X=Xk  ^Xixk^k=l 
converge  in  distribution  to  Gaussian  sequences.  We  then  explore  the  effects  of 
making  various  assumptions  about  the  lifelength  and  censoring  random  variables. 
Under  the  most  general  model,  the  PEGE  is  not  consistent  and  the  failure  rate 
estimators  are  not  asymptotically  uncorrelated:  a  sufficient  condition  for 
consistency  is  independence  between  corresponding  lifelength  and  censoring 
random  variables,  and  a  sufficient  condition  for  asymptotically  independent 
failure  rate  estimators  is  that  the  censoring  random  variables  be  identically 
distributed.  However,  it  is  not  necessary  to  impose  both  of  these  conditions 
in  order  to  ensure  both  consistency  and  asymptotic  independence  of  the  failure 
rate  estimators:  relaxing  the  condition  of  independent  lifelength  and  censoring 
random  variables,  we  give  conditions  under  which  both  desirable  properties  are 
obtained. 

Before  investigating  the  asymptotic  properties  of  the  PEGE,  we  describe 
the  theoretical  framework  of  the  problem,  give  some  notation,  and  present  a 
preliminary  result  that  facilitates  the  exploration  of  the  asymptotic  properties 
of  the  PEGE. 

The  probability  space  (ft,A,P)  on  which  all  of  the  lifelength  and  censoring 
random  variables  are  defined  is  envisaged  as  the  infinite  product  probability 
space  that  may  be  constructed  in  the  usual  way  from  the  sequence  of  probability 
spaces  corresponding  to  the  sequence  of  independent  random  pairs  (Xj,Y^), 
(X2,Y2),  ...  .  Thus  ft  consists  of  all  possible  sequences  of  pairs  of  outcomes 
corresponding  to  pairs  of  realizations  in  Xx^:  the  first  member  of  each  pair 


■ 


corresponds  to  failure  at  a  particular  time  and  the  second  member  of  each  pair 
corresponds  to  censorship  at  a  particular  time  --  that  is,  for  each  u  in  Q, 
k  =  1,  2,  ...  and  j  =  1,  2,  ...: 

(Xi.Y.Ku)  =  (X.(u)),Y.(a>))  =  (xk,yj)  if  the  ith  element  of  the 

infinite  sequence  w  is  the  pair  of  outcomes  corresponding  to 
failure  at  and  censorship  at  Yy 

The  argument  w  is  omitted  wherever  possible. 

Two  conditions  are  imposed  on  the  random  pairs  (X^.Y^,  ^.Y^,  ...: 
(Al)  There  is  a  distribution  function  F  such  that 

i  n 

limi  l  P(Z.Sx.)  =  FCx^  for  k  =  1,  2,  ...  . 

n-x*>  i=i 

(A 2)  There  is  a  subdistribution  function  F1  such  that 

i  n  1 

lim  i-  l  PCZ.Sx^  6.=1)  =  Fi(xk)  for  k  =  1,  2,  ...  . 
n-*»  i=l 

It  is  evident  that  a  sufficient  condition  for  (Al)  and  (A2)  is  that  the 
censoring  random  variables  be  identically  distributed. 

Definitions  of  symbols  used  in  this  section  are  given  below.  Assumptions 
(Al)  and  (A2)  ensure  the  existence  of  the  limits  defined. 


Let  Pki  =  p(Zi=xk,  6^1)  for  k  =  1,  2,  ...  and  i  =  1,  ...,  n, 
**ki  =  P^Zi=xk’  for  h  =  1,  2,  ...  and  i  =  1,  ....  n, 

pk  ■ llm  r  jAi  *  pl(VFltVi)  fork’1-2 . 

n->®  i=l 


lim  i  l  R.  .  for  k  =  1,  2,  .. . 
n-*»  n  i=i  K1 
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The  proposition  below  is  fundamental:  it  asserts  that,  with  probability 
one,  as  the  sample  size  increases  to  infinity,  at  least  one  failure  is  observed 
at  every  possible  value  of  the  lifelength  random  variable.  First,  we  need  a 
definition. 


Definition  3.1.  Let  ft*  c  a  be  the  set  of  infinite  sequences  which  contain,  for 
each  possible  failure  time,  at  least  one  element  corresponding  to  the  outcome  of 
observing  failure  at  that  time  —  that  is, 

ft*  =  {to:  (vk)(3n)  Xn(uj)  =  xR,  Yn(a>)  *  XjJ  . 

Proposition  3.2.  P(ft*)  =  1. 


The  proposition  is  proven  by  showing  that  the  set  of  infinite  sequences 
that  do  not  contain  at  least  one  element  corresponding  to  the  outcome  of 
observing  failure  at  each  possible  failure  time  x^  has  probability  zero  -- 
that  is, 


tli!  i2i<Xi=v 


=  0  for  k  =  1,  2, 


As  the  pairs  (X2,Y2),  ...  are  independent,  this  is  equivalent  to 

proving  the  following  equality: 


n 

(3.1)  lim  Jf  (1  -  P(X.=x,,  Y.^x,))  =  0  for  k  =  1,  2,  . . .  . 

n-*»  i=l  1  1 

00  00 

Since  JJ”  (1-p^)  =  0  if  and  only  if  £  =  ®,  where  (p^}^ai  is  any  sequence  of 


probabilities,  and  since  (A2)  implies  that 
00 

1  p(Xi=xfc»  =  oo  for  k  =  1,  2,  ...,  we  have  (3.1). 

i=l 
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The  importance  of  the  preceding  proposition  lies  in  the  simplifications 
it  allows.  It  turns  out  that,  on  ft*  and  for  n  large  enough,  the  PEGE  may 
be  expressed  in  simple  terms  of  functions  that  have  well-known  convergence 
properties.  Since  P(fi*)  =  1,  we  need  consider  the  asymptotic  properties  of 
the  PEGE  on  ft*  alone:  these  properties  are  easily  obtained  from  those  of  the 
well-known  functions. 

In  order  to  express  the  PEGE  in  this  convenient  way,  we  view  the  estimation 
procedure  in  an  asymptotic  context. 

Suppose  a)  is  chosen  arbitrarily  from  ft*.  Then,  for  each  k,  there  is  an 
N  (depending  on  k  and  u)  such  that  X^(w)  =  x.  and  Y^w)  >:  for  j  =  1  ,  ..., 
k  and  some  i  S  N.  Consequently,  for  n  ^  N,  the  smallest  k  distinct  observed 


failure  times  tj. 


t^  are  merely  x^, 


. ..,  x,  ,  and,  since  the  set  of  possible 


censoring  times  is  contained  in  X,  the  smallest  k  distinct  observed  times  are 


also  Xj,  ....  x^.  The  first  k  intervals  between  observed  failure  times  are 
simply  (O.Xj],  (Xj.x^,  ...,  an<*  the  function  T^  n  defined  on  the 

i**1  interval  is  given  by  the  number  of  units  on  test  just  before  the  end  of  the 
ith  interval  --  that  is. 


(3.2)  T.  =  nF  (x7)  =  nF  (x.  .)  for  i  =  1 ,  ...»  k  and  n  2:  N. 

lyll  Ill  Tl  X  1 

Likewise,  we  express  the  function  n  defined  on  the  ith  interval  in  terms 
of  the  empirical  subdistribution  function  F^  as  follows. 

(3.3)  Di  n  =  n^Fn(xj)  '  Fn^xi  j_)3  for  i  =  1»  •••»  k  and  n  2  N. 


As  the  PEGE  is  a  function  of  D.  and  T.  ,  it  can  be  expressed  in  terms 

i,n  i,n*  v 

of  the  empirical  functions  F^  and  F^.  Specifically,  on  ft*,  for  any  choice  of 
k,  there  is  an  N  such  that 


for  n  £  N. 


P„(X>xk)  =  TT(i- 


k  r  F'cx.j-F'u..,) 


Consequently,  taking  the  limit  of  each  side  and  using  Proposition  3.2,  we 
have : 


lim 

n-*» 


k 

lim  TT 

n-*»  i=l 


1  - 


F1(x.)-F1(x.  ,) x 
n  1  nv  i-l'  ' 

F  (x.  ,) 
nv  i-l' 


for  k  =  1,  2,  , , 


In  exploring  the  asymptotic  behaviour  of  the  PEGE,  therefore,  we  consider 
the  behaviour  of  the  limiting  sequence  of  the  sequence 


' 

k  f 

F1(x.)-F1(x.  .)> 

1 

TT 

i=l  v 

j  n  l  n  i-l 

The  proofs  of  the  results  that  follow  are  omitted  in  the  interest  of 
brevity.  The  most  general  model  we  consider  is  that  in  which  only  conditions 
(Al)  and  (A2)  are  imposed.  The  following  theorem  identifies  the  limits  of 
the  sequences  {Pn(X=XjJ  X^x^)  }”_j  and  {^(X^)  }*  for  k  =  1,  2,  ...  and 

A  <  00  A  00 

establishes  that  the  sequences  {PnCX=x^{  and  ^Pn^X>xk^k-l  conver8e  to 

Gaussian  sequences. 


Theorem  3.3. 

(i)  With  probability  1, 

F^O-F^x  ) 

lim  P  (X=xk|XSx.)  =  - £ - £-5-  for  k  =  1,  2,  ...  . 

n-K°  p(xk-l5 


(ii)  With  probability  1, 


F^x.^Cx.j) 


Pfx. 


for  k  *  1,  2,  ...  . 


k 
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(iii)  Let  k^,  ....  kf]  be  M  arbitrarily  chosen  integers  such  that 


kl  <  k2  <=  ...  <  kM.  Then 


(Pn(X=xk  IX^  PfX-^  (X^  ))  is  AN(u*  ir*), 

11  M  M 


where 


M*  =  (P,  /F (x^  ^j)»*..»Pk 

1  1  M  M 


E*  =  {aqr}q=l,... ,M 
r=l , . . . ,M 


q-1  r-1 

°qr  ~  Pk  Pk  ^ak.  ,k .+  ak„+k.  ,k .  +ak.,k,,+k.  +akM+k.  ,k,.+k.‘ 

M  q  r  1=1  j=l  l*  j  M  l*  j  i*  M  j  M  l  M  j 


q  r 


+  Pk  j  (ok  +k  ,k.+°k  +k  ,kM+k.)/((PCxk  -1)}  P(xk  -1):) 
r  i=l  M  q*  i  M  q  M  l  r  q 

+  Pk  j  (ok.,k  +ak,.+k.,k  i/(p(xk  -1} (P(xk  -1^  ) 

q  1=1  l*  r  M  l *  r  r  q 


*  Vk  .k  /(f(*k  -l)f(Xk  -l”  f°r  q  5  r- 

M  q  r  q  r 


lim  ^  l  Pk  ±C1-Pk  J  for  q  =  r,  q  =  1,  -  M. 

n-xx.  n  i=l  Kq»  q’ 

1  n 

-lim  i  l  P.  .P.  .  for  q  <  r,  q  =  1,  . . . ,  M,  r  =  1,  .. . ,  M. 

n-*»  n  i=l  V  r’1 

-lim  -Tp,  .  for  q  =  1  +1,  ,  2M,  r  =  1,  . . . ,  M. 

n-K°  i=l  q-M  r  ’ 

i  n 

lim  i  I  R.  .  (1-R,  •)  for  q  =  r,  q  =  M+L,  ....  2M. 

n-*»  i=l  q-M*  q-M’ 

-lim  i  l  R.  .R,  for  q,<  r,  q  =  M+l,  . 2M,  r  =  M+l,  . 

n-*»  i=l  q-M*  r-M’ 


2M, 
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(iv)  Let  k,,  . k„  be  M  arbitrarily  chosen  integers  such  that 

1  M 

k.  <  k_  <  ...  <  k...  Then 
12  M 

(Pn(x>xki)»---.Pn(X>XkM))  is 

where 


*1  M 

JU**  =  .IT  (l-Pi/F(xi  j) TT  Cl-Pi/F(xi  j)) 


{o**}  . 

qr  q=l , . . . ,M 

r=l, . . . ,M 


k  k 


=  if  (1-p  /P(x  ))  TT (1-p ,/F(x  ))  f  f  pyui-p^FCx^pd-pyFCx^ji): 

x=l  j  =  l  J  -t=l  m=l 


for  q  ^  r. 


It  is  evident  from  the  theorem  above  that  the  PEGE  is  a  strongly  consis¬ 
tent  estimator  of  the  underlying  survival  function  if  and  only  if 


(3.4) 


Fl(xk)“Fl(xk-1)  P(x=xk) 


P(\-l> 


P(XSxR) 


for  k  =  1 ,  2 ,  ...  . 


The  theorems  below  give  conditions  under  which  this  equality  holds.  As  for 
correlation,  it  is  evident  from  the  structure  of  the  PEGE  that  any  two  elements 
of  the  sequence  ^n(x>xk)}^_1  are  correlated.  Consequently  the  matrix  E** 
cannot  be  reduced  to  a  diagonal  matrix  under  even  the  most  stringent  conditions. 
However  it  turns  out  that,  under  certain  conditions,  the  asymptotic  correlation 

A  .  QO 

between  pairs  of  the  sequence  {Pn(X=XjJ X>x^) is  zero  --  that  is,  E*  is  a 
diagonal  matrix. 

The  following  theorem  shows  that  independence  between  lifelength  and 
censoring  random  variables  results  in  strongly  consistent  (and  therefore 


•  *  •  •A**/* ’  *.*-*•• ' 

*  •  •  •*/  *.*  -•  v  *  r.  • .  •  .  *  .  • 


asymptotically  unbiased)  estimators.  However  any  pair  in  the  sequence 

co 

is  asymptotically  correlated  in  this  case.  Since  the 
matrices  I*  and  I**  have  the  same  form  as  in  the  theorem  above,  they  are  not 
explicitly  defined  below. 


(Pn(X=xk|X,xk) 


Theorem  3.4. 

Suppose 

(i)  the  random  variables  and  are  independent  for  i  =  1,  2,  ... 

and 

(ii)  there  is  a  distribution  function  H  such  that 

l  n 

lira  i  l  P(Y.Sx.)  =  H(x.)  for  k  =  1,  2,  ...  . 
n-~>  i=l  1  K  K 

Then 

l  k 

(iii)  F1  (xR)  =  ^P(X=x.)fi(xi  l)  and  F(xk)  =  P(X>xk)H(xk)  for  k  =  1,  2, 


(iv)  With  probability  1, 


lira 

n-v» 


for  k  =  1,  2,  ...  . 


(v) 


where  k^  <  k2  <  ...  <  k^  are  arbitrarily  chosen  integers  and 

U*  =  (P(X=x,  |Xix.  ),..., P(X=x.  |X2jc  ))  . 

1  K1  kM  kM 


(vi)  (Pn(X>xki) 


,Pn(X>xk^))  is  AN(w*\±Z**), 


where  k.  <  k,  <  ...  <  kM  are  arbitrarily  chosen  integers  and 


A  sufficient  condition  for  (Al) ,  (A2)  and  assumption  (ii)  of  the 
preceding  theorem  is  that  the  censoring  random  variables  be  identically 
distributed.  In  this  case  the  failure  rate  estimators  are  asymptotically 
independent  and  the  matrix  £**  is  somewhat  simplified.  The  conditions  of 
the  following  corollary  define  the  model  of  random  censorship  widely  assumed 
in  the  literature. 


(v)  (Pn(X>xk^),...,Pn(X>xk^))  is  AN(p**4e**) 


where 


y**  =  (P(X>x.  P(X>x.  )), 

*1  M 

I**  =  {a**}  . 

qr  q=l, . . . ,M 

r=l, .  ..  ,M 

k 

q 

a*Z  =  P(X>x.  )P(X>x.  )  l  P(X=x. |X>x.)/[F(x.  1)P(X>x.|XSx.)J  for  q  s  r. 
qr  Kq  Kr  i=1  XI  l-i  11 

Having  dealt  with  the  most  restrictive  case  in  which  the  lifelength  and 
censoring  random  variables  are  assumed  to  be  independent,  we  now  consider 
relaxing  this  condition.  It  turns  out  that  independence  between  corresponding 
lifelength  and  censoring  random  variables  is  not  necessary  for  asymptotic 
independence  between  pairs  of  the  sequence  of  failure  rate  estimators:  if  the 
censoring  random  variables  are  assumed  to  be  identically  distributed  but  not 
necessarily  independent  of  the  corresponding  lifelength  random  variables,  then 
the  failure  rate  estimators  are  asymptotically  independent.  However  both  the 
survival  function  and  failure  rate  estimators  are  asymptotically  biased.  The 
following  corollary  expresses  these  facts  formally. 

Corollr.ry  3.6. 

Suppose 

(i)  the  random  variables  Y^,  Y^,  ...  are  identically  distributed. 

Then 

(ii)  Pk  =  P(Z=xk,  6=1)  and  F(xk)  =  P(Z>xk)  for  k  =  1,  2,  ..., 

(iii)  (P  (X=x,  |X>x,  ),..., P  CX=x,  |X*x  ))  is  AN(ix*,k*), 

n  K1  K1  n  km  km  ~  n 


where 


y*  =  (P  /F(x.  .),...,  P  /F(x  )), 

•'ii 


pk. C1-pk./^Cxk.-l))/CFCxk.-i))  for  1  =  j 


a* .  =  * 
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for  i  *  j 


(iv)  (Pn(K»t  ),...,pn(x>xk  ))  is  , 

1  M 


where 


rk 


,** 


i=l 


1  kM 

IT  a-p./FCx.p) .  •»TT(i"Pi/FCxi_1)) 


Z**  =  {an}.  , 

3-^  3  ~ !»••«> M 

£=1,...,M 
k .  k 

»;?  - 


j!  -  JTtl-Pi/ffxi-l»n;(l-Pm/P(Vl))  i,! V[!f(Vl»'  CI-VPtxr-l»] 

J  1=1  m=l  r=l 

for  j  s  l. 


The  corollaries  above  give  sufficient  (rather  than  necessary)  conditions 
for  the  two  desirable  properties  of  (i)  consistency  and  (ii)  asymptotic  inde¬ 
pendence  between  pairs  of  the  sequence  of  failure  rate  estimators 
{Pn(X=XjJ Xix^) The  Final  corollaries  show  that  both  of  the  conditions  of 
Corollary  3.5  are  not  necessary  for  these  two  desirable  properties:  the  con¬ 
ditions  specified  in  these  corollaries  are  not  so  stringent  as  to  require  that 
corresponding  censoring  and  lifelength  random  variables  be  independent  (as  in 
Corollary  3.5),  but  rather  that  they  be  related  in  a  certain  way. 


Corollary  3.7. 


If  the  random  variables  Y^ ,  Y2,  ...  are  identically  distributed,  then  v'ith 


probability  1, 


if  and  only  if 


lim  IJn(X>Xjc)  =  G(x^)  for  V.  =  1,  2, 
n-*°  N 


P (Y^  >  |  X  =  x^)  =  P (Y^  a  x^  |  X  2  x^.)  for  k  =  1 ,  2 ,  ...  and  i  =  1 ,  2  ,  ...  . 

Corollary  3.8. 

Suppose 

(i)  the  random  variables  Y^,  Y2 ,  ...  are  identically  distributed 

and 

(ii)  p ^ ! X=xk^  =  P (Y^Sx.  | X2x^)  for  k  =  1,  2,  ...  and  i  =  1,  2,  ...  . 


(iii)  (Pn(X=xk  |X2xk  ,Pn(X=xk  jX2xk  ))  is  ANfy*,iz*), 

11  MM 


where 


=  (P(X=x  |X>x  ),  ...,  P(X=x.  |X>*  )), 

1  1  KM  kM 


{of-}.  ,  ,, 

l]  i=l, . . . ,M 

j~l, . . . ,M 


?(X=xk  |X>xk  )P(X>xk JXlxk  )/F(xk  _p  for  i  =  j. 


11  11 


for  i  *  j. 


(iv)  (Pn(X>xk  ),..., Pn(X>xk  ))  is  AN  (vi* 
1  M  ~ 


inhere 


y**  =  (P(X>xk^),...,P(X>xk^)), 

2**  =  {o*£h  . 

ji  j-1, . . .  ,M 

l- 


K  . 

’*,*  “  P(X>xk  )P(X>xk  )  {  P(X=x.|X>x.)/(P(xi_1)P(X>xi|X2xi)]  for  j 


The  last  two  corollaries  are  of  special  interest  because  they  deal  with 
consistency  and  asymptotic  independence  in  the  case  of  dependent  lifelength 
and  censoring  random  variables  —  a  situation  that  is  not  generally  considered 
despite  its  obvious  practical  significance.  Desu  and  Narula  (1977),  Langberg, 
Proschan  and  Quinzi  (1980)  and  Kitchin,  Langberg  and  Proschan  (1983)  consider 
the  continuous  version  of  the  model  specified  in  the  last  two  corollaries. 

The  condition  specifying  the  relationship  between  lifelength  and  censoring 
random  variables  is  in  fact  a  mild  one:  re-expressing  it,  we  have  the  following 
condition. 


P(X=xk|X>xk,Y.2xk)  P (*rxk) 


tor  V.  =  1 ,  2 ,  ...  and  i  =  1 ,  2 ,  . , 


P(X>xk|X>xk,Y.>xk)  '  P(X>xR) 

This  condition  specifies  that  the  failure  rate  among  these  under  observation 
at  any  particular  age  is  the  same  as  the  failure  rate  of  the  whole  population 
of  that  age.  It  is  evident  both  intuitively  and  mathematically  that  this  is 
a  fundamental  assumption  inherent  in  the  process  of  estimating  a  life  distri¬ 
bution  from  incomplete  data:  if  this  assumption  could  not  be  made,  the  data 
available  would  be  deemed  inadequate  for  estimating  the  life  distribution. 
Formally,  it  is  the  fact  that  the  condition  is  both  necessary  and  sufficient 


for  consistency  that  indicates  that  it  is  minimal  for  the  estimation  process. 

It  is  clear,  therefore,  that  the  last  two  corollaries  play  an  important  role 
in  estimation  in  the  context  of  a  practical  model  more  general  than  the  statis¬ 
tically  convenient,  but  unnecessarily  restrictive,  model  of  random  censorship. 


Section  4:  The  PrGE  Compared  with  Rival s . 

In  Section  1  we  motivate  the  construction  of  the  PEGF:  by  describing  some 


desirable  properties  of  nonparametric  survival  function  estimators  and  then 
mentioning  that  the  commonly  use?  estimator  of  Kaplan  and  Meier  (i958)  does 


:->t  fare  well  in  terms  of  these  properties.  We  now  compare  the  PEGE  with  the 
Kaplan^Meier  estimator. 

We  begin  with  the  most  obvious  desirable  features  of  survival  function 
estimators  and  then  consider  statistical  and  mathematical  properties.  In 
comparing  the  tv/o  estimators,  we  find  that  the  issue  of  continuity  arises  and 
that  the  PEXE  enters  the  comparison.  The  section  ends  with  an  example  using 
real  data.  The  subsequent  section  continues  the  comparison:  we  discuss  the 
results  of  simulation  studies. 

The  Kaplan-Meier  estimator  (KME)  of  the  survival  function  of  the  life- 
length  random  variable  X  is  defined  as  follows: 


P(X>t) 


1 


for  n^  =  0  or  t  <  tj,  n}  s  1. 


i-1 


TT (l-D./nF  (tT)) 
j=l  3  11  3 


for  timl 


s  t  < 


V 


i  «  2, 


n, 


£  2. 


Tt(l-D./nF  (tT))  for  t  2  t  n.  *  1. 
j=l  1  n  3  nl  1 


To  the  prospective  user  of  a  survival  function  estimator,  two  fundamental 
questions  are,  firstly,  does  the  estimating  function  have  the  appearance  of  a 
survival  function,  and  secondly,  is  it  easy  to  compute? 

Considering  the  second  question  first,  we  observe  that  calculating  the 
PEGE  involves  only  marginally  more  effort  than  calculating  the  KME.  Therefore 
both  estimators  are  accessible  to  users  equipped  with  only  hand  calculators. 

The  first  question  is  a  deeper  one.  If  the  sample  is  sm-tll  or  if  there 


are  many  ties  among  the  uncensored  observations  in  a  large  sample,  the  KME 
has  only  a  few  steps  and  consequently  appears  unrealistic.  The  PEGE,  in 
contrast,  reflects  the  continuity  inherent  in  any  life  process  by  decreasing 


; ’i t  every  possible  failure  time,  not  only  at  the  observed  failure  times.  As 
the  number  of  distinct  uncensored  observations  increases ,  both  the  PEGE  and 
the  KME  become  smoother:  the  many  steps  of  the  KME  do  allow  it  the  appearance 
of  a  survival  function,  except  possibly  at  the  right  extreme  —  there  is 
no  way  of  extrapolating  very  far  beyond  the  range  of  observation  if  the  KME 
is  used.  (There  are  several  ways  of  extrapolating  from  the  PEGE.)  At  face 
value,  therefore,  the  PEGE  is  at  least  as  attractive  as  the  KME. 

A  related  consideration  is  whether  the  estimator  provides  a  realistic 
estimate  of  the  failure  rate  function.  The  KME,  being  a  step  function,  does 
not.  The  seriousness  of  this  omission  becomes  more  apparent  when  the  KJT. 
failure  rate  function  is  examined  from  a  user's  point  of  view:  if  an  item  cf 
age  t  has  a  (perhaps  large)  chance  of  failing  at  its  age,  then  claiming  that 
a  slightly  older  (or  slightly  younger)  item  cannot  fail  at  its  age  seems 
unreasonable,  particularly  when  it  becomes  evident  that  the  claim  is  made  on 
the  grounds  that  none  of  the  items  on  test  happened  to  fail  just  after  (or 
just  before)  time  t.  Intuitively  --  or  from  a  frequentist ' 3  point  of  view  -- 
the  very  fact  that  one  of  the  items  on  test  failed  at  time  t  makes  it  less 
likely  that  another  item  in  the  sample  will  fail  soon  after  l  because  che 
observed  failure  times  should  be  scattered  along  the  appropriate  range 
according  to  the  distribution  function.  Clearly,  then,  the  gaps  between 
observed  failure  times  are  a  result  of  the  fact  that  the  sample  is  finite 
and  are  not  indicative  of  zero  (or  very  small)  failure  rates. 

The  P2GE,  on  the  other  hand,  is  constructed  so  that  a  failure  at  time  t, 
say,  affects  the  failure  rate  in  the  gap  before  t.  Thus  the  PEGE  compensates 
for  the  lack  of  observations  at  the  possible  (but  unobserved)  failure  times, 
fhe  resultant  failure  rate  function,  being  a  step  function,  is  still  n3ive,  but 
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ii  Goes  at  least  take  into  account  the  continuity  of  life  processes  and  it  does 
provide  reasonable  estimates  of  the  failure  rates  at  all  possible  failure  times. 

A  more  aesthetic  —  but  none  the  less  important  --  issue  is  that  of  infor¬ 
mation  loss.  Here  the  PEGE  is  again  at  an  advantage.  Although  interval  infor¬ 
mation  about  the  uncensored  observations  is  used  in  spacing  out  the  successive 
values  of  the  KME,  the  failure  rate  estimators  utilize  only  ordinal  information. 
Moreover,  the  only  information  utilized  from  the  censored  observations  is  their 
positioning  relative  to  the  uncensored  observations.  Thus  the  information  lost 
by  the  KME  is  of  both  the  ordinal  and  interval  types.  In  contrast,  the  PEGE 
failure  rate  estimators  use  interval  information  from  all  the  observations:  in 
particular,  the  positions  of  censored  observations  are  taken  into  account 
precisely.  In  terns  of  information  usage,  then,  the  PEGE  is  far  more  desirable 
than  the  KME. 

An  apparently  attractive  feature  of  the  KME  is  that  its  values  are  invar¬ 
iant  under  monotone  transformation  of  the  scale  of  measurement.  The  PEGE  is 
not  invariant  under  even  linear  transformation.  However,  in  the  light  of  the 
discussion  about  information  loss,  it  is  evident  that  the  KMF. 's  invariance,  and 
the  PEGE's  lack  thereof,  are  results  of  their  levels  of  sophistication  rather 
than  properties  that  can  be  used  for  comparison. 

Having  noted  that  the  step  function  form  of  the  KME  is  not  pleasing,  we 
now  point  cut  that  it  is  also  responsible  for  a  statistical  defect,  namely, 
that  the  KME  tends  to  overestimate  the  underlying  survival  function  and  its 
percentiles.  The  fact  that  the  KIE  consistently  overestimates  suggests  that 
its  form  is  inappropriate.  Some  indications  about  the  bias  of  the  PEGE  are 
given  by  considering  the  relationship  between  the  PEGE  and  the  KME. 

Under  certain  conditions  (for  example,  if  there  are  no  ties  among  the 
uncensored  observations),  the  PEGE  and  the  KME  interlace:  within  each  failure 
interval,  the  PEGE  crosses  the  KME  once  from  above.  This  is  not  true  in 
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general,  however.  It  turns  out  that  the  KME  may  have  large  steps  in  the 
presence  of  ties.  In  the  case  of  the  PEGE,  however,  the  effect  of  the  ties 
is  damped  and  the  FEGE  decreases  slowly  relative  to  the  KME.  In  general, 
therefore,  it  is  possible  to  relate  the  PECE  and  tho  KME  only  in  a  one-sided 
fashion:  specifically,  the  PEGE  at  any  observed  failure  time  is  larger  than 
the  KME  at.  that  time.  Examples  have  been  constructed  to  show  that,  in  general, 
the  PEGE  cannot  be  bounded  from  above  by  the  KME.  The  following  theorem 
relates  P  (the  PEGE)  and  P  (the  KME) . 


Theorem  4.1. 

(i)  P(X>tt)  >  PU^)  for  i  =  1,  ...,  nr 
(ii)  If  n?n(t^_l)/(nfn(t^^)  +Vb_1)  ^  0,/D^  for  j  =2,  ...,  i, 

whore  denotes  the  number  of  censored  observations  at  for  j  =>  1,  ...,  nj , 
then  P(X>t^)  S  P(X>t7)  for  i  =  1,  ...,  n^. 

It  is  evident  that  the  condition  in  (ii)  is  met  if  there  are  no  ties 
among  the  uncensoTed  observations;  this  is  likely  if  the  sample  is  small. 

From  the  relationships  in  the  theorem,  we  infer  that  the  bias  of  the  PEGE 
is  likely  to  be  of  the  same  order  of  magnitude  as  that  of  the  KM.  Further 
indications  about  bias  are  given  later. 

Having  considered  some  of  the  practical  and  physical  features  of  the 
PEGE  and  the  KME,  we  turn  briefly  to  asymptotic  properties  --  briefly  because 
the  PEGE  and  the  KME  arc  asymptotically  equivalent  --  that  is. 


Pf(Vk)  lim  P  (X>*k) 


lim  P  (X> 
n 


1. 


The  practical  implication  of  this  is  that  there  is  little  reason  for  strong 
preference  of  either  the  PECE  or  the  Kr£  if  the  sample  ir  very  large. 


Wo  new  compare  the  models  assumed  in  using  the  KME  and  the  PEGE.  In 
the  many  studies  of  tne  KME,  the  most  general  model  includes  the  assumption 
of  independence  between  corresponding  life  and  censoring  random  variables, 
our  most  general  model  does  not  include  this  assumption.  However  this 
difference  is  not  important  because  the  assumption  of  independence  is  used 
only  to  facilitate  the  derivation  of  certain  asymptotic  properties  of  the 
KME:  in  fact,  the  definition  of  the  KME  does  not  depend  on  this  assumption, 
and  the  KME  and  the  PEGE  axe  asymptotically  touivaisiw  under  the  conditions 
of  the  most  general  model  of  the  PEGE.  Therefore  this  assumption  is  not 
necessary  for  using  the  KKE. 

The  other  difference  between  the  models  assumed  is  tint  the  PEGE  is 
designed  specifically  for  discrete  life  and  consoling  distributions  wnile 
the  Kaplan-Meicr  model  makes  no  stipulations  about  the  supports  of  these 
distributions .  However,  distinguishing  between  continuous  and  discrete 
random  variables  in  this  context  is  merely  a  statistical  convention  — 
in  fact,  time  to  occurrence  of  some  event  is  always  measured  along  a  con¬ 
tinuous  scale,  and  the  set  of  observable  taluks  is  always  countable  because 
it  is  defined  by  the  precision  of  measurement.  Since  the  process  of  esti¬ 
mating  a  life  distribution  requires  me asu cements ,  it  always  entails  the 
assumption  of  a  discrete  distribution:  in  ether  the  support  ef  the  estimator 
is  continuous  or  discrete  depends  on  the  way  the  user  oerceives  the  scale 
of  measurement.  In  practice,  therefore,  there  ore  nc  differences  between 
tee  models  underlying  the  PEGL  and  the  KM?,:  the  PEG?  is  appropriate  whenever 
the  KM?  is,  end  vice  versa. 

Having  pointed  out  that  the  PEGS  may  be  used  for  estimating  coeLinuous 
sucvival  functions ,  ani  having  introduced  the  PeXE  as  the  c  niiuuous  counter 
pert  of  the  PEGE,  ve  cc-mp-nve  the  two  Pi  vt  we  note  f  b?t  the  PEXE  is  the 


continuous  version  of  the  PEGE  because  the  construction  of  each  is  based  on 
the  assumption  of  constant  failure  rate  between  distinct  observed  failure 
times.  The  forms  of  the  estimators  differ  because  of  the  difference  in  the 
ways  of  expressing  discrete  and  continuous  survival  functions  in  terms  of 
failure  rates.  The  PEGE  and  the  PEXE  are  equally  widely  applicable  since 
a  minor  modification  of  the  PEXE  can  be  made  to  allow  for  ties.  (This  esti¬ 
mator  is  defined  in  Whittemore  and  Keller  (1983).) 

The  relationship  between  the  PEGE  and  the  modified  PEXE,  and  their  posi¬ 
tioning  relative  to  the  KME,  is  summarized  by  the  following  theorem  and  the 
succeeding  relationship. 

Theorem  4.2. 

Let  P**(X>t)  denote  the  modified  PEXE  of  the  survival  probability 
P(X>t)  for  t  >  0. 

(i)  P(X>t)  <  P**(X>t)  for  t  >  0. 

(ii)  If  nPn(tj_l)/(Dp|1(t^1)  ♦  W^j)  s  Dj/Dj-i  for  j  =  2 . i*  where 

VT  denotes  the  number  of  censored  observations  at  t^  for  j  -  1,  . .. ,  n^, 
then  P**(X>ti)  s  P(X>ti  l)  for  i  =  1,  ...,  n^ 

From  Theorems  4.1(i)  and  4.2(i),  we  have: 

P(X>ti)  5  P(X>tt)  <  P**(X>ti)  for  i  =  1,  ...,  nr 

Consequently,  if  the  condition  in  (ii)  above  is  met  (as  it  is  when  there  are 
no  ties  among  the  uncensored  observations) ,  both  the  PEGE  and  the  PEXE  inter¬ 
lace  with  the  KME:  in  each  interval  of  the  form  (t ^  j,t^],  the  PEGE  and  the 
PEXE  cross  the  KME  once  from  above.  Practical  experience  suggests  that  the 
condition  in  (ii)  above  is  not  a  stringent  one:  even  though  this  condition 
is  violated  in  many  of  the  data  sets  considered  to  date,  the  PEGE  and  the 


PEXE  still  interlace  with  the  KME  in  the  manner  described.  Another  indication 
from  practical  experience  is  that  the  difference  between  the  PEXE  and  the  PE HE 
is  negligible,  even  in  small  samples. 

Finally,  we  present  an  example  using  the  data  of  Freireich  et  al.  (1963). 
The  data  are  the  remission  times  of  21  leukemia  patients  who  have  received 
6  MP  (a  mercaptopurine  used  in  the  treatment  of  leukemia) .  The  ordered  remis¬ 
sion  times  in  weeks  are:  6,  6,  6,  6+,  7,  9+,  10,  10+ ,  11+,  13,  16,  17+ ,  19+ , 
20*,  22,  23,  25+ ,  32+ ,  32+ ,  34* ,  35+ .  The  PEGE  and  the  KME  are  presented  in 
Figure  4.1.  (Since  the  PEGE  and  the  PEXE  differ  by  at  most  .09,  only  the 
PEGE  appears.)  The  graphs  illustrate  the  smoothness  of  the  PEGE  in  contrast 
with  the  jagged  outline  of  the  KME.  The  KME  and  the  PEGE  interlace  even  though 
the  condition  in  Theorems  4.1 (ii)  and  4.2(ii)  is  violated.  Since  the  PEGE  is 
only  slightly  above  the  KME  at  the  observed  failure  times  and  the  PEGE  crosses 
the  KME  early  in  each  failure  interval,  the  KME  is  considerably  larger  than  the 
PEGE  by  the  end  of  each  interval.  This  behaviour  is  typical .  We  infer  that  the 
PEGE  certainly  does  not  overestimate:  it  may  even  tend  to  underestimate. 

We  conclude  that  the  PEGE  (and  the  modified  PEXE)  have  significant  advan¬ 
tages  over  the  KME,  particularly  in  the  cases  of  large  samples  containing  many 
ties  and  small  samples.  It  is  only  in  the  case  of  a  large  sample  spread  over 
a  large  range  that  the  slight  increase  in  computational  effort  required  for 
the  PEGE  might  merit  using  the  KME  because  the  PEGE  and  the  KME  are  likely  to 
be  very  similar. 


In  this  section  we  give  some  indications  of  the  small  sample  properties 


of  the  PEGE  by  considering  three  simulation  studies.  In  the  first  study, 
Kitchin  (1980)  compares  the  small  sample  properties  of  the  PEXE  with  those 
of  the  KME.  In  the  second  study,  Whitteraore  and  Keller  (1983)  consider  the 
small  sample  behaviour  of  a  number  of  estimators:  we  extract  the  results  for 
the  KME  and  a  particular  version  of  the  PEXE.  In  the  third  study,  we  make  a 
preliminary  comparison  of  the  KME  and  the  PEGE.  We  expect  the  behaviour  of 
the  piecewise  exponential  estimators  to  resemble  that  of  the  PEGE  because 
piecewise  exponential  estimators  are  continuous  verions  of  the  PEGE  and,  more¬ 
over,  piecewise  exponential  estimators  and  the  PEGE  are  similar  when  the  under¬ 
lying  life  distribution  is  continuous. 

The  piecewise  exponential  estimator  considered  by  Whittemore  and  Keller 
is  denoted  Fn  .  It  is  constructed  by  averaging  the  PEXE  failure  rate  function 

”4 

estimator  with  a  variant  of  the  PEXE  failure  rate  function  estimator  --  that 

A 

is,  F_  is  the  same  as  the  PEXE  except  that  the  PEXE  failure  rate  estimators 
g4 

X“,  ...,  X~  are  replaced  by  the  failure  rate  estimators  X*  ...,  X*  defined 
1 

as  follows: 

X?  -  2g(X,^*Xj^^)  for  i  —  1,  . ..,  n^ , 


where 


x7  =  D^/Total  time  on  test  in  (t^_j,t^]  for  i  =  1,  ...»  n^, 

X?  =  D^Total  time  on  test  in  Ct-^t.^)  for  i  =  °»  •••*  n^-1, 

f  D  /Total  time  on  test  in  [t  ,»)  if  max  Z.  >  t  . 

♦  I  nl  nl  lsisn  1  nl 


otherwise. 
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Although  Whittemore  and  Keller  include  in  their  study  the  two  estimators 


F-  and  Fn  constructed  from  A' 


A’  and  A, , 
n,  1- 


A  respectively,  thev 
1 


present  the  results  for  the  hybrid  estimator  F  alone  because  they  find  that 

-  i 

Fn  tends  to  be  negatively  biased  and  Fn  tends  to  be  positively  biased. 

^1 


The  same  model  is  assumed  in  all  three  studies.  The  model  is  that  of 
random  censorship:  corresponding  life  and  censoring  random  variables  are  inde¬ 
pendent  and  the  censoring  random  variables  are  identically  distributed.  Whitte¬ 
more  and  Keller  generate  200  samples  in  each  of  the  6x3x4  =  72  situations 
that  result  from  considering  six  life  distributions  (representing  failure 
rate  functions  that  are  constant,  linearly  increasing,  exponentially  increasing, 
decreasing,  U-shaped,  and  discontinuous),  three  levels  of  censoring 
(P(Y<X)  ~  0,  .55,  .76),  and  four  sample  sizes  (n=10,  25,  50,  100).  Kitchin 
obtains  1000  samples  in  each  of  a  variety  of  situations :  he  considers  four 
life  distributions  (Exponential ,  Weibull  with  parameter  2,  Weibull  with  para¬ 
meter  and  Uniform),  three  levels  of  censoring  (P(Y<X)  =  0,  .5,  .67),  and 
four  sample  sizes  (n=  10,  20,  50).  Kitchin's  study  is  broader  than  that  of 
Whittemore  and  Keller  in  that  Kitchin  considers  Exponential,  Weibull 
and  Uniform  censoring  distributions  while  Whittemore  and  Keller  consider  only 
Exponential  censoring  distributions.  Kitchin  apparently  produces  the  greater 
variety  of  sampling  conditions  because  his  results  vary  slightly  according  to 
the  model,  while  Whittemore  and  Keller  find  so  much  similarity  in  the  results 
from  the  various  distributions  that  they  record  only  the  results  from  the 
Weibull  distribution. 

The  conclusions  we  draw  from  the  two  studies  are  similar.  Regarding  mean 
squared  error  (MSE) ,  both  Kitchin  and  Whittemore  and  Keller  find  that,  in 
general : 
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(i)  The  MSE  of  the  exponential  estimator  is  smaller  than  that  of  the 

KHE. 

(ii)  As  the  level  of  censoring  increases,  the  increase  in  the  MSE  is 
smaller  for  the  exponential  estimator  than  for  the  KME. 


Kitchin  reports  that  (i)  and  (ii)  are  not  always  true  of  the  PEXE  and  the  KME: 
the  exceptional  cases  occur  in  the  tails  of  the  distributions. 

The  conclusions  about  bias  are  not  so  straightforward.  Whittemore  and 
Keller  find  that  the  PEXE  tends  to  be  negatively  biased  while  Kitchin  reports 
that  the  bias  of  the  PEXE  is  a  monotone  increasing  function  of  time:  examining 
his  figures,  we  find  that  the  bias  tends  to  be  near  zero  at  some  point  between 
the  40th  and  60t^  percentiles  except  when  the  life  and  censoring  distributions 
are  Uniform.  (In  this  case,  the  bias  is  positive  only  after  the  90^  percentile.) 
We  conclude  that  Whittemore  and  Keller  merely  avoid  detailed  discussion  of  bias. 
Regarding  the  hybrid  estimator,  we  find  in  the  figures  recorded  some  suggestions 
of  the  tendencies  observed  in  the  PEXE  --  specifically,  monotone  increasing  bias 
and  a  tendency  for  underestimation  when  the  sample  size  is  small  and  censoring 
is  heavy.  Whether  this  behaviour  is  typical  of  the  PEGE  also  remains  to  be  seen. 

In  considering  the  magnitude  of  the  bias  of  the  estimators,  we  find  the 
following. 


(i)  Both  Kitchin  and  Whittemore  and  Keller  report  that  the  bias  of  the 
KME  is  negligible  except  in  the  right  tail  of  the  distribution  and  in  the  case 
of  a  very  small  sample  (n=10)  and  heavy  censoring. 

(ii)  The  PEXE  is  considerably  more  biased  than  the  KME. 


(iii)  The  bias  of  ?n  is  negligible  except  in  the  case  of  a  very  small 
sample  and  heavy  censoring. 

(iv)  The  bias  of  each  estimator  increases  as  the  censoring  becomes  heavier 


and  it  decreases  as  the  sample  size  increases. 


In  view  of  these  two  studies,  we  conclude,  firstly,  that  the  PEGE  is 
likely  to  compare  favourably  with  the  KME  in  terms  of  MSE,  and  secondly,  that 
the  PEGE  is  likely  to  be  considerably  more  biased  than  the  KME.  We  expect 
that  the  discrete  counterpart  of  Fn  performs  well  in  terms  of  both  MSE  and 
bias.  Since  the  bias  of  this  estimator  is  likely  to  be  small,  adjustment  for 
its  presumed  tendency  to  increase  monotonically  is  deemed  an  unnecessary 
complication. 

In  the  pilot  study  we  generate  three  collections  of  data,  each  consisting 
of  100  samples  of  size  10,  from  independent  Geometric  life  and  censoring  dis¬ 
tributions.  In  each  case  the  life  distribution  has  parameter  p  =  exp(-.l). 

The  censoring  distributions  are  chosen  so  as  to  produce  three  levels  of  censoring 
setting  p  =  exp(-X),  where  X  =  .00001,  .1,  .3,  yields  the  censoring  probabilities 
P(Y<X)  =0,  .475,  .711  respectively. 

The  conventions  followed  for  extrapolation  in  the  range  beyond  the  largest 
observed  failure  time  are  as  follows: 


f  P£x>t„  )  for  t  s  k  <  s  . 


P(X>k)  = 


n 


n 


1 


no 


P(X>k) 


for  k  2  s  s  t  . 

n2  nl 

k-t 


P(X>t  )(l-a  )  nl  for  kit  , 

1  nl  nl 


This  definition  of  the  KME  rests  on  the  assumption  that  the  largest  observation 
is  uncensored,  while  the  definition  of  the  PEGE  results  from  assuming  that  the 
failure  rate  after  the  largest  observed  failure  time  is  the  same  as  the  failure 
rate  in  the  interval  (t  . ,  t  ] . 

V1  ni 

Our  conventions  for  extrapolation  differ  from  those  of  Kitchin  and  of 


Whitteraore  and  Keller.  Consequently  our  results  involving  right-hand  tail 
probabilities  differ  from  theirs:  a  preliminary  indication  is  that  our  extra¬ 
polation  procedures  result  in  estimators  that  are  more  realistic  than  theirs. 
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Although  the  size  of  the  study  precludes  reaching  more  than  tentative 
conclusions,  we  observe  several  tendencies. 

Tables  1(a),  2(a)  and  3(a)  contain  the  estimated  bias  and  mean  squared 
error  (MSE)  for  the  KME  and  the  PEGE  of  P(X>k)  for  k  =  5  ,  where  is  the 

percentile  of  the  underlying  life  distribution  and  p  =  1,  5,  10,  20,  30, 

40,  50,  60,  70,  80,  90,  95,  99.  From  these  tables  we  make  the  following 
observations. 

(i)  The  MSE  of  the  PEGE  is  generally  smaller  than  that  of  the  KME.  The 
exceptions  occur  in  the  right-hand  tail  of  the  distribution  under  conditions 
of  moderate  and  heavy  censoring. 

(ii)  The  MSE  of  each  estimator  increases  as  censoring  increases. 

(iii)  The  disparity  in  the  MSE  of  the  two  estimators  becomes  more  marked 
as  the  censoring  increases  —  that  is,  the  MSE  of  the  PEGE  increases  by  rela¬ 
tively  little  as  the  censoring  increases,  except  in  the  right-hand  tail. 

(iv)  The  difference  in  the  MSE  of  the  two  estimators  is  smallest  near  the 
median  of  the  distribution. 

(v)  Both  the  KME  and  the  PEGE  generally  exhibit  negative  bias:  the  mag¬ 
nitude  of  the  bias  of  each  estimator  is  greatest  around  the  median  of  the 
distribution. 

(vi)  The  magnitude  of  the  bias  of  the  KME  is  consistently  smaller  than 
that  of  the  PEGE  only  when  there  is  no  censoring.  Under  conditions  of  moderate 
and  heavy  censoring,  the  KME  is  less  biased  than  the  PEGE  only  at  percentiles 
to  the  left  of  the  median:  to  the  right  of  the  median,  the  PEGE  is  considerably 
less  biased  than  the  KME. 

(vii)  As  censoring  increases,  the  magnitude  of  the  bias  of  the  KME  increases 
faster  than  does  that  of  the  PEGE. 


Tables  1(b),  2(b)  and  3(b)  contain  the  estimated  bias  and  MSE  for  the 
Kanlan-Meier  (KM)  and  piecewise  geometric  (PG)  estimators  of  the  percentiles 
£p,  p  =  1,  5,  10,  20,  30,  40,  50,  60,  <70,  80,  90,  95,  99.  From  these  tables 
we  make  the  following  observations. 

(i)  With  a  few  exceptions,  the  PG  percentile  estimator  is  less  biased 
than  the  KM  percentile  estimator. 

(ii)  Both  estimators  tend  to  be  negatively  biased. 

(iii)  At  each  level  of  censoring,  the  bias  of  the  PG  percentile  estimator 
is  negligible  for  percentiles  smaller  than  the  70th,  and  it  is  acceptably 
small  for  larger  percentiles,  except  perhaps  the  99th  percentile.  In  contrast, 
the  KM  percentile  estimators  are  almost  unbiased  only  for  percentiles  smaller 
than  the  60th:  to  the  right  of  the  60th  percentile  the  bias  tends  to  be  very 
much  larger  than  that  of  the  PG  estimators.  This  tendency  is  particularly 
noticeable  in  the  case  of  heavy  censoring. 

(iv)  The  MSE  of  the  PG  percentile  estimator  is  smaller  than  that  of  the 
KM  percentile  estimator  only  in  certain  ranges,  viz:  p  s  70  for  heavy  censoring, 
p  ^  40  for  moderate  censoring,  and  5  £  p  £  95  for  no  censoring.  Since  the  PG 
percentile  estimator  is  almost  unbiased  outside  these  ranges,  the  large  MSE 
must  be  the  result  of  having  large  variance. 

On  the  basis  of  the  observations  involving  the  survival  function  esti¬ 
mators,  we  conclude  that  the  small  sample  behaviour  of  the  PEGE  resembles  that 
of  the  PEXE:  specifically,  when  there  is  little  or  no  censoring,  the  PEGE 
compares  favourably  with  the  KME  in  terms  of  MSE  but  not  in  terms  of  bias.  We 
expect  that  this  is  true  irrespective  of  the  level  of  censoring  when  the 
sample  size  is  larger.  It  remains  to  be  seen  whether  inversion  of  this  general 
behaviour  is  typical  when  the  sample  size  is  very  small  and  censoring  is  heavy. 


It  is  evident  that  increased  censoring  affects  the  bias  and  the  MSE  of  PEGE 
less  than  it  affects  the  bias  and  the  MSE  of  the  KME. 

Our  conclusions  about  the  percentile  estimators  are  even  more  tentative 
because  of  the  lack  of  results  involving  the  behaviour  of  percentile  estimators. 
The  fact  that  the  PG  percentile  estimator  is  almost  unbiased  even  in  the  pre¬ 
sence  of  heavy  censoring,  and  even  as  far  to  the  right  as  the  95th  percentile, 
is  of  considerable  interest  because  the  KM  extrapolation  procedures  are  clearly 
inadequate  for  estimating  extreme  right  percentiles. 

Regarding  the  MSE,  we  note  that,  under  conditions  of  moderate  or  heavy 
censoring,  any  estimator  of  the  larger  percentiles  is  expected  to  vary  consi¬ 
derably  because  there  are  likely  to  be  very  few  observations  in  this  range. 

The  ad  hoc  extrapolation  procedure  for  the  KM  is  expected  to  cause  the  estimators 
of  the  extreme  right  percentiles  to  exhibit  large  negative  bias  and  little 
variation.  In  view  of  these  considerations  and  the  accuracy  of  the  PG  percen¬ 
tile  estimators,  we  conclude  that  the  fact  that  the  MSE  of  the  PG  percentile 
estimator  of  the  larger  percentiles  is  greater  than  that  of  the  KM  percentile 
estimator  is  not  evidence  of  a  breakdown  in  the  reliability  and  efficiency  of 
the  PG  percentile  estimator. 


The  general  indications  of  our  pilot  study  are  that  the  PEGE  and  the 
£ 

discrete  version  of  Fn  are  attractive  alternatives  to  the  KME.  In  view  of  the 

resemblance  between  the  properties  of  the  PEGE  and  those  of  the  PEXE,  the 
* 

results  for  portend  well  for  the  new  discrete  estimator:  we  expect  it  to  be 
almost  unbiased  and  to  be  not  only  more  efficient  than  the  KME  but  also  more 


stable  under  increased  censoring.  Moreover,  we  expect  the  corresponding 
percentile  estimator  to  have  these  desirable  properties  also  because  it  is 
likely  to  behave  at  least  as  well  as  the  PG  percentile  estimator. 


The  properties  involving  relative  efficiency  are  of  considerable  impor¬ 
tance  because  relative  efficiency  is  a  measure  of  the  relative  quantities  of 
information  utilized  by  the  estimators  being  compared.  This  interpretation 
of  relative  efficiency,  and  the  fact  that  heavy  censoring  is  often  encountered 


in  engineering  problems,  makes 


and  its  discrete  counterpart  even  more 


attractive. 
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Table  1.  Results  of  Pilot  Study  Using  100  Samples  of  Size  10,  Geometric 
Cp  =  exp(-*1))  Life  Distribution,  Geometric  (p  =  exp(-*00001))  Censoring 
Distribution  and  P(Y<X)  =  0. 


a)  Survival  Function  Estimators. 

I  Estimated  Bias 


ercentxle 


Estimated  MSE 


PEGE 


(b)  Percentile  Estimators. 

!  Estimated  Bias 


Percentile 


Estimated  MSE 


1.63 

1.63 

1.37 

2.38 
3.88 

5.71 

9.72 

14.37 
22.82 
35.96 

95.37 


cO  |  Cl, 


£•: 


ble  2.  Results  of  Pilot  Study  Using  100  Samples  of  Size  10,  Geometric 
=  exp(-*l))  Life  Distribution,  Geometric  (p=exp(-*l))  Censoring  Distribution 
and  P(Y<X)  *  .475. 

(a)  Survival  Function  Estimators. 


V 


|  Estimated  Bias  i 

Estimated  MSE 

Percentile 

PEGE 

KME 

PEGE 

KME 

1 

1 

-.0018 

.0077 

BBS 

S 

■ 

-.0018 

.0077 

■  p 

10 

.0106 

.0124 

n 

20 

.0094 

-.0042 

.0244  | 

■Eg] 

40 

-.0037 

50 

-.0509 

-.0230 

.0475 

.0601 

60 

-.0442 

.0430 

70 

-.0553 

-.0800 

80 

-.0707 

1 

.0413 

90 

-.0590 

.0151 

95 

.0082 

99 

1 

(b)  Percentile  Estimators. 


Estimated 

Bias 

Estimated  MSE 

Percentile 

PEGE 

KME 

PEGE 

KME 

1 

.00 

.80 

.00 

3.36 

5 

.19 

.80 

.33 

3.36 

10 

-.34 

-.20 

1.66 

2.76 

20 

-.09 

.08 

3.69 

5.36 

30 

.38 

.80 

7.40 

9.84 

40 

.10 

.64 

12.62 

17.24 

50 

.77 

1.43 

20.97 

25.21 

60 

-.20 

.62 

34.24 

37.26 

70 

-.67 

-1.44 

64.85 

36.28 

80 

-.88 

-2.73 

128.02 

52.21 

90 

-1.23 

-8.92 

302.31 

121.66 

95 

-.60 

-14.92 

561.06 

264.70 

99 

-2.30 

-31.92 

1497.30 

1060.98 

Table  3.  Results  of  Pilot  Study  Using  100  Samples  of  Size  10,  Geometric 
(p  =  exp(-*l))  Life  Distribution,  Geometric  (p=exp(--3))  Censoring  Distri 
bution  and  P(Y<X)  »  .711. 

(a)  Survival  Function  Estimators. 


Percentile 


Estimated  Bias 


PEGE 


-.0018 

-.0018 

.0033 

-.0273 

-.0479 

-.1011 

-.1443 

-.2421 

-.2286 

-.1775 


Estimated  MSE 


PEGE 


Percentile  Estimators. 


Percentile 


Estimated  Bias 


PEGE  KME 


Estimated  MSE 


.52 

.68 

1.37 

3.22 

7.19 

15.16 

28.06 

50.99 

90.72 

167.67 

357.58 

619.71 

1508.06 


84.44 

252.63 

474.99 

1496.01 
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